Demographic Factors Improve Classification Performance

نویسنده

  • Dirk Hovy
چکیده

Extra-linguistic factors influence language use, and are accounted for by speakers and listeners. Most natural language processing (NLP) tasks to date, however, treat language as uniform. This assumption can harm performance. We investigate the effect of including demographic information on performance in a variety of text-classification tasks. We find that by including age or gender information, we consistently and significantly improve performance over demographic-agnostic models. These results hold across three text-classification tasks in five languages.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Increasing the accuracy of the classification of diabetic patients in terms of functional limitation using linear and nonlinear combinations of biomarkers: Ramp AUC method

The Area under the ROC Curve (AUC) is a common index for evaluating the ability of the biomarkers for classification. In practice, a single biomarker has limited classification ability, so to improve the classification performance, we are interested in combining biomarkers linearly and nonlinearly. In this study, while introducing various types of loss functions, the Ramp AUC method and some of...

متن کامل

Social Collateral and Repayment Performance: Evidence from Islamic Micro Finance

I n this study we designed to test the remarkable repayment performance of Akhuwat in Pakistan; the most successful Islamic Microfinance Institution (IMFI), which offers interest-free loans in order to improve the quality of life and alleviate poverty. The model of Akhuwat is based on Muakhaat (brotherhood) and Qard-e-Hasan (offering financial assi...

متن کامل

Green envelopes classification: the comparative analysis of efficient factors on the thermal and energy performance of green envelopes

This paper classifies green envelopes as green roofs and green walls according to effective factors, which were derived from literature to compare the green envelopes’ thermal and energy performance in a more effective way. For this purpose, an extensive literature review was carried out by searching keywords in databases and studying related journal papers and articles. The research meth...

متن کامل

Factors Affecting the Elderly\'s Quality of Life in the Middle East: A Systematic Review

Aims: Identifying the factors affecting the older adultchr('39')s quality of life can be effective in finding ways to improve their quality of life. Therefore, this study aimed to investigate the factors affecting older adultschr('39') quality of life in Middle Eastern countries. Information & Methods: This systematic review study was conducted in March and April 2020. According to the World H...

متن کامل

USING DISTRIBUTION OF DATA TO ENHANCE PERFORMANCE OF FUZZY CLASSIFICATION SYSTEMS

This paper considers the automatic design of fuzzy rule-basedclassification systems based on labeled data. The classification performance andinterpretability are of major importance in these systems. In this paper, weutilize the distribution of training patterns in decision subspace of each fuzzyrule to improve its initially assigned certainty grade (i.e. rule weight). Ourapproach uses a punish...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015